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Abstract 

Interference is usually viewed as an obstacle to communication in wireless networks. This paper proposes a 
new strategy, compute-and-forward, that exploits interference to obtain significantly higher rates between users in 
a network. The key idea is that relays should decode linear functions of transmitted messages according to their 
observed channel coefficients rather than ignoring the interference as noise. After decoding these linear equations, the 
relays simply send them towards the destinations, which given enough equations, can recover their desired messages. 
The underlying codes are based on nested lattices whose algebraic structure ensures that integer combinations of 
codewords can be decoded reliably. Encoders map messages from a finite field to a lattice and decoders recover 
equations of lattice points which are then mapped back to equations over the finite field. This scheme is applicable 
even if the transmitters lack channel state information. Its potential is demonstrated through examples drawn from 
cooperative communication including distributed MIMO and wireless network coding. 

Index Terms 

Relaying, cooperative communication, structured codes, nested lattice codes, reliable computation, AWGN 
' networks, interference. 
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I. Introduction 



In a wireless network, a transmission from a single node is heard not only by the intended receiver, but also by 
Q\ ' all other nearby nodes; by analogy, any receiver not only captures the signal from its designated transmitter, but 
1 ■ from all other nearby transmitters. The resulting interference is usually viewed as highly undesirable and clever 
^ ', algorithms and protocols have been devised to avoid interference between transmitters. Collectively, these strategies 
^ ' transform the physical layer into a set of reliable bit pipes, i.e. each link can accommodate a certain number of 
^ bits per time unit. These bit pipes can then be used seamlessly by higher layers in the protocol stack. 
Q\ \ Since wireless terminals must compete for the same fixed chunk of spectrum, interference avoidance results in 
^ ' diminishing rates as the network size increases. Recent work on cooperative communication has shown that this 
■ penalty can be overcome by adopting new strategies at the physical layer. The key idea is that users should help 
^ , relay each other's messages by exploiting the broadcast and multiple-access properties of the wireless medium; 
$— i ' properties that are usually viewed as a hindrance and are not captured by a bit pipe interface. To date, most proposed 
. cooperative schemes have relied on one of the following three core relaying strategies: 

• Decode-and-Forward: The relay decodes at least some part of the transmitted messages. The recovered bits 
are then re-encoded for collaborative transmission to the next relay. Although this strategy offers significant 
advantages, the relay is ultimately interference-limited as the number of transmitted messages increases [l]-[4]. 

• Compress-and-Forward: The signal observed at the relay is vector quantized and this information is passed 
towards the destination. If the destination receives information from multiple relays, it can treat the network as a 
multiple-input multiple-output (MIMO) channel. Unfortunately, since no decoding is performed at intermediate 
nodes, noise builds up as messages traverse the network [1], [3], [5]-[7]. 

• Amplify '-and-Forward: The relay simply acts as a repeater and transmits a scaled version of its observation. 
Like compress-and-forward, this strategy converts the network into a large MIMO channel with the added 
possibility of a beamforming gain. However, noise also builds up with each retransmission. [2], [4], [8]— [10]. 
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In this paper, we propose a new strategy, compute-and-forward, that enables relays to decode linear equations 
of the transmitted messages using the noisy linear combinations provided by the channel. A destination, given 
sufficiently many linear combinations, can solve for its desired messages. Our strategy is based on the use 
of structured codes, particularly nested lattice codes. This ensures that integer combinations of codewords are 
themselves codewords. A relay is free to determine which linear combination to recover, but those closer to the 
channel's fading coefficients are available at higher rates. 

This strategy simultaneously affords protection against noise and the opportunity to exploit interference for 
cooperative gains. Whereas compress-and-forward and amplify-and-forward convert a network into a set of noisy 
linear equations, compute-and-forward can convert it into a set of reliable linear equations. These equations can 
in turn be used for a digital implementation of cooperative schemes that could fit into a (slightly revised) network 
protocol stack. Classical relaying strategies seem to require a cross-layer design that dispenses with bit pipes and 
gives higher layers in the network stack direct access to the wireless medium. However, this would negate many 
of the advantages of a modular design [11]. Compute-and-forward provides a natural solution to this problem by 
permitting a slight revision of the interface from bits to equations of bits. 

We will develop a general framework for compute-and-forward that can be used in any relay network with 
linear channels and additive white Gaussian noise (AWGN). Transmitters send out messages taking values in a 
prime-sized finite field and relays recover linear equations of the messages over the same field, making this an ideal 
physical-layer interface for network coding. We will compare compute-and-forward to classical relaying strategies 
in two network scenarios, one based on distributed MIMO and the other wireless network coding. Classical relaying 
strategies perform well in either low or high signal-to-noise ratio (SNR) regimes. As we will see, compute-and- 
forward offers advantages in moderate SNR regimes where both interference and noise are significant factors. 

A. Related Work 

There is a large body of work on lattice codes and their applications in communications. We cannot do justice 
to all this work here and point the interested reader to an excellent survey by Zamir [12]. The basic insight is 
that, for many AWGN networks of interest, nested lattice codes can approach the performance of standard random 
coding arguments. One key result by Erez and Zamir showed that nested lattice codes (combined with lattice 
decoding) can achieve the capacity of the point-to-point AWGN channel [13]. More generally, Zamir, Shamai, 
and Erez demonstrated how to use nested lattice codes for many classical AWGN multi-terminal problems in [14]. 
Subsequent work by El Gamal, Caire, and Damen showed that nested lattice codes achieve the diversity-multiplexing 
tradeoff of MIMO channels [15]. Note that, in general, structured codes are not sufficient to prove capacity results. 
For instance, group codes cannot approach the capacity of asymmetric discrete memory less channels [16]. 

It is tempting to assume that requiring codes to have a certain algebraic structure diminishes their usefulness 
for proving capacity theorems. However, it has become clear that for certain network communication scenarios, 
structured codes can actually outperform standard random coding arguments [17]. The first example of such behavior 
was found by Korner and Marton in [18]. They considered a decoder that wants to reconstruct the parity of two 
dependent binary sources observed by separate encoders. They found the rate region by using the same linear 
code at each encoder. More recently, we showed that structured codes offer large gains for reliable computation 
over multiple-access channels [19]. Philosof et al. demonstrated that structured codes enable distributed dirty paper 
coding for Gaussian multiple-access channels in [20]. 

The celebrated paper of Ahlswede et al. on network coding showed that for wired networks, relays must send 
out functions of received data, rather than just routing it [21]. Subsequent work showed that linear codes [22], [23] 
and linear codes with random coefficients [24] are sufficient for multicasting. There has recently been a great deal 
of interest in exploiting the physical layer of the wireless medium for network coding [25], [26]. In [27], [28], we 
used structured codes for network coding over finite-field and Gaussian multiple-access networks. In a similar vein, 
Narayanan, Wilson, and Sprintson developed a nested lattice strategy for the two-way relay channel [29]. Nam, 
Chung, and Lee studied the two-way relay channel and Gaussian multiple-access networks with asymmetric power 
constraints in [30], [31]. Giindiiz et al. used structured codes for multicasting over a compound multiple-access 
channel with the help of a relay [32]. 

Work on interference alignment by Maddah-Ali, Motahari, and Khandani [33] and Cadambe and Jafar [34] has 
shown that large gains are possible for interference channels at high SNR. The key is to have users transmit along 
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subspaces chosen such that all interference stacks up in the same dimensions at the receivers. Lattice codes can be 
used to realize these gains at finite SNR. Bresler, Parekh, and Tse used lattice codes to approximate the capacity of 
the many-to-one and one-to-many interference channels to within a constant number of bits [35]. This scheme was 
employed for bursty interference channels in [36]. For symmetric interference channels, Sridharan et al. developed 
a layered lattice strategy in [37]. Structured codes are also useful for ergodic alignment over fast fading interference 
channels [38] and multi-hop networks [39] as well as decentralized processing in cellular networks [40], [41]. 

Distributed source coding can also benefit from the use of structured codes. Krithivasan and Pradhan have 
employed nested lattice codes for distributed compression of linear functions of jointly Gaussian sources [42]. 
More recently, they have developed a framework for discrete memoryless sources using nested group codes [43]. 
Wagner derived an outer bound for the Gaussian case in [44]. 

Large gains are also possible in multi-user source-channel coding [45]-[47]. For Gaussian settings, the modulo- 
lattice modulation scheme of Kochman and Zamir is particularly useful [48]. Finally, recent work by He and Yener 
has shown that lattices are useful for information theoretic secrecy [49]. See also [50]. 

B. Summary of Paper Results 

Our basic strategy is to take messages from a finite field, map them onto lattice points, and transmit (dithered 
versions of) these across the channel. Each relay observes a linear combination of these lattice points and attempts 
to decode an integer combination of them. This equation of lattice points is finally mapped back to a linear equation 
over a finite field. Our main theorems are summarized below: 

• Theorems 1 and 2 give our achievable rates for sending equations over a finite field from transmitters to relays. 
The strategy relies on a nested lattice code which is developed in Theorem 3. 

• Theorems 4, 5, and 6 give sufficient conditions on the equation coefficients so that a destination can recover 
one or more of the original messages. 

• In Theorems 7 and 8 we generalize our compute-and-forward scheme to include successive cancellation and 
superposition coding. 

Finally, we give two applications of compute-and-forward and compare the performance to that of classical 
relaying strategies. In Section VIII, we consider a distributed MIMO problem and, in Section IX, we consider a 
wireless network coding problem. 

II. Problem Statement 

Our relaying strategy is applicable to any configuration of sources, relays, and destinations that are linked through 
linear AWGN channels. We will refer to such configurations as AWGN networks. To simplify the description of 
the scheme, we will first focus on how to deliver equations to a single set of relays. We will then show how 
a destination, given sufficiently many equations, can recover the intended messages. These two components are 
sufficient to completely describe an achievable rate region for any AWGN network. We begin with the necessary 
definitions for L transmitters to send equations to M relays over a wireless channel. Then, we will provide a 
framework for recovering the original messages given a set of equations. 

Remark 1: In fact, one can apply this strategy to non-linear and non-Gaussian channels as well by using the 
modulo-lattice transformation developed by Erez and Zamir in [51]. 

A. Decoding Equations 

We are primarily interested in narrowband wireless channel models so we will specify our encoding and decoding 
schemes for complex baseband and assume that modulation is handled separately. In Section II-C, we will restate 
a few definitions for real-valued channel models so that we can give corollaries that specialize our results to real 
values. 

Let C denote the complex field and F p denote the finite field of size p where p is always assumed to be prime. Let 
+ denote addition over the complex field and © addition over the finite field. Furthermore, let denote summation 
over the complex field and denote summation over the finite field. We also set j = \/— T an d assume that the log 
operation is with respect to base 2. It will be useful to map between the finite field and a corresponding subset of 
the integers. We let g : ¥ p — > {0, 1, 2, . . . ,p— 1} be this one-to-one map. This is essentially an identity map except 
for the change of alphabet. If g or its inverse g^ 1 are applied to a vector we assume they operate element-wise. 
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Fig. 1. L transmitters reliably communicate linear functions to M relays. 
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We will use boldface lowercase letters to denote colu mn vectors a nd boldface uppercase letters to denote matrices. 

For example, h G C L and H G C MxL . Let ||h|| = \JYa=i \ h W denote the ^ 2 -norm of h. Also, let h* and h T 
denote the Hermitian (or conjugate) transpose and the regular transpose of h, respectively. Finally, let denote the 
zero vector, S( denote the unit vector with 1 in the £' h entry and elsewhere, and I MxM denote the identity matrix 
of size M. 

Our main goal is to reliably transmit messages across the network at the highest possible rates. However, 
intermediate nodes in the network may only need to decode a function of the messages. These functions can 
be decoded at high rates if we choose their algebraic structure to match the structure of the channel. Thus, we will 
split our messages over the finite field into two parts: one for the real part of the channel and the other for the 
imaginary part. 

Definition 1 (Messages): Each transmitter (indexed by I = 1, 2, . . . , L) has two length-^ vectors over a prime- 
size finite field, w^,w^ 6 F*'. The superscript denotes whether the vector is intended for the real part or the 
imaginary part of the channel. Together these vectors are the message of transmitter I, = (w^,wj). Without 
loss of generality, we assume that the transmitters are numbered by increasing message length. Since we are 
interested in functions of these messages vectors, we zero-pad them to a common length k = max^^ prior to 
encoding. 

Remark 2: We are interested in average probability of error results so we assume that all messages are drawn 
independently and uniformly from the set of all possible values. 

Definition 2 (Encoders): Each transmitter is equipped with an encoder, Si, 
the finite field to length-n codewords over the complex field: 



that maps length-A; messages over 



S e : F£ x Fj -»• C r ' 



for £ = 1,2, ...,L. 

Definition 3 (Message Rate): The message rate rg of each transmitter is: 

n = — logp 

n 

Definition 4 (Power Constraint): Each transmitter's length-n channel input, 
power constraint: 



(1) 



(2) 



1 



n 



\xif < SNR 



£e(we), is subject to the usual 



(3) 



for SNR > and £= 1,2,... ,L. 

Remark 3: Note that asymmetric power constraints can be easily modeled by scaling the channel coefficients 
appropriately. 

Definition 5 ( Channel Model): Each relay observes a noisy linear combination of the transmitted signals through 
the channel: 

L 



(4) 
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1,2, ...,M where h m £ G C are the channel coefficients and z is i.i.d. circularly symmetric Gaussian 



CM(0,I nxn ). Let h m = [h m i ■ ■ ■ h m L\ T denote the vector of channel coefficients to relay m and let 



for m - 

noise, z ~ uyv ^u, ± ). Lei n m = \n m \ • • • n m L 

H = {h m e} denote the entire channel matrix. Note that by this convention the m th row of H is h^. 

Remark 4: For our initial analysis, we will assume that the channel coefficients are fixed for all time. However, 
these results can easily be extended to the slow fading case under an outage formulation which we develop in 
Section VIII. 

Remark 5: Our coding scheme only requires that each relay knows the channel coefficients from each transmitter 
to itself. Specifically, relay m only needs to know h m . Each transmitter only needs to know its target message rate, 
not the realization of the channel. 

Definition 6 (Decoders): Each relay is equipped with a decoder that maps the observed channel output from the 
complex field back to two equations of messages over the finite field: 



V m : <C n 



F£ xF^ 



(5) 



for m = 1,2,... ,M. 

Our choice of equation structure is inspired by the real-valued decomposition of a complex-valued channel. 
Recall that for any H G C MxL , x G C L , and z € C M , the channel output y = Hx + z can be written as: 



" Re(y) " 




. Im(y) _ 





+ 



Re(z) 
lm(z) 



Re(H) -lm(H) 1 [ Re(x) 
lm(H) Re(H) [ lm(x) 

where Re(-) and Im(-) represent the real and imaginary parts. 

Note that the structure of the equations at an intermediate relay is not important, so long as they are useful for 
conveying messages to the destination at the highest possible rate. 

Definition 7 (Desired Equations): The goal of each relay is to reliably recover a linear combination of the 
transmitted messages. Relay m selects coefficients Q^Q^g £ F p for i = 1, 2, . . . , L and attempts to decode two 
equations: 



u, 



(6) 
(7) 



where (—q m t) denotes the additive inverse of q m g. 

Although our desired equations are evaluated over the finite field ¥ p , the channel operates over the complex 
field C. Our coding scheme will allow us to efficiently exploit the channel for reliable computation if the desired 
equation coefficients are close to the channel coefficients in an appropriate sense. The definition below provides an 
embedding from the finite field to the complex field that will be useful in quantifying this closeness. 

Definition 8 (Coefficient Vector): The equation with coefficient vector a m = [a m \ a m 2 • • • a m j)^ G {Z + jZ} L 
is the linear combination of the transmitted messages with coefficients given by 

= g- 1 ([Re(ew)] mod p) (8) 

iLe = 9" 1 ([Im(ow)] mod p) . (9) 
Definition 9 (Probability of Error): We say that the equations with coefficient vectors ai,a2, . . . ,&m G + 
jZ} L are decoded with average probability of error e if: 

(utuL) = V m (y m ) (10) 



M 



p[ |J {(<&<&) *(»£>0} 



< e. 



(11) 



vm=l 



Definition 10 (Computation Rate): We say that the computation rate R(h, a) is achievable if for any e > 
and n large enough, there exist encoders and decoders, Si,... ,Sl,T>i, . . . ,T>m, such that for any set of channels, 
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hi, ... , hju € C L , and coefficient vectors, ai, . . . , &m S {Z + j'Z} L , all relays can recover their desired equations 
with average probability of error e so long as the underlying message rates n, . . . , tl satisfy: 

re < min R(h m ,a m ) (12) 

m:a m e 5=0 

Remark 6: Note that the above definition means that each relay is free to choose which equation to recover. The 
only constraint is that the message rates be lower than the computation rate for that choice of coefficients. In many 
cases, it will be useful to have each relay recover the equation that is available at the highest computation rate. 

Remark 7: The minimization in (12) is taken over non-zero equation coefficients since only the information in 
these messages is required for the desired equation. 

B. Recovering Messages 

After the relays decode their equations, they forward them towards the appropriate destinations which may only 
be interested in recovering the original messages. If the equation coefficients satisfy appropriate conditions, then the 
destinations can solve for the original messages. Although our scheme can be employed in any AWGN network, 
we will omit formal definitions for such networks and assume that equations of messages arrive at a destination. 
This may occur through a single layer of channels as described above or through multiple layers. 

Definition 11 (Recovery): We say that message G ¥ p e can be recovered at rate m from the equations 
with coefficient vectors ai, . . . ,slm G {Z + jZ} L if for any e > and n large enough, there exists a decoder 
V : {Fp x ¥ k p } M -► F*« x F** such that: 

w^ = P((uf,u(),...,(i4,i4)) (13) 
P (w^ + w t ) < e (14) 

where (u^,u^J represents the equations from relay m as in Definition 7. 

C. Real-Valued Channels 

We will now restate some of the above definitions for real-valued channel models. Each transmitter £ has a 
length-/^ message that takes values over a prime-size finite field, G ¥ p l . As before, we zero-pad all messages to 
a common length k = max; hi prior to encoding. The message rate is given by re = ^ log p. The channel model is 
as in (4) except all inputs and outputs are over the reals instead of the complex field and the noise is i.i.d. Gaussian 
with mean zero and variance one. 

For real-valued channels, the desired linear combination of messages at relay m is: 

L 

u m = 0g m w (15) 

£=1 

for coefficients q m i G F p . These finite field coefficients can be embedded in the reals as well. The equation 
with coefficient vector a m G Z L is the linear combination of the transmitted messages with coefficients given by 
Qmi = [g~ l { a mi)] m od p. The rest of the definitions map in a straightforward fashion to the reals. 

III. Nested Lattice Codes 

In order to allow relays to decode integer combinations of codewords, we need codebooks with a linear structure. 
We will use lattice codes that have both good statistical and good algebraic properties. Erez and Zamir showed 
that nested lattice codes can approach the capacity of point-to-point AWGN channels in [13]. These codes operate 
under a modulo arithmetic that is well-suited for mapping operations over a finite field to the complex field. 

First, we will provide some necessary definitions from [13] on nested lattice codes. Note that all of these 
definitions are given over W 1 . Our scheme will use the same lattice code over the real and imaginary parts of the 
channel input (albeit with different messages). Next, we will describe how to construct nested lattice codes which 
have certain desirable properties at sufficiently high dimension. 
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A. Lattice Definitions 

Definition 12 (Lattice): An n-dimensional lattice, A, is a set of points in W 1 such that if x, y G A, then x+y G A, 
and if x G A, then — x 6 A. A lattice can always be written in terms of a lattice generator matrix L G M nxri : 



A = {x = Lw : w G Z n }. 



(16) 



Note that the origin is always a point in the lattice. 

Definition 13 (Nested Lattices): A lattice A is said to be nested in a lattice Ai if A C Ai. We will sometimes 
refer to A as the coarse lattice and Ai as the fine lattice. More generally, a sequence of lattices A, Ai,. . . , A^ is 
nested if A C Ai C • • • C A^. 

Definition 14 (Quantizer): A lattice quantizer is a map, Qa : W l — > A, that sends a point, x, to the nearest 
lattice point in Euclidean distance: 



Qa(x) = argmin||x - A||. 



(17) 



Definition 15 (Voronoi Region): The fundamental Voronoi region, V, of a lattice, is the set of all points in W l 
that are closest to the zero vector: V = {x : Qa(x) = 0}. Let Vol(V) denote the volume of V. 

Definition 16 (Modulus): Let [x] mod A denote the quantization error of x G W 1 with respect to the lattice A: 



[x] mod A = x — Qa(x) 

For all x, y G R n and A C Ai, the mod A operation satisfies: 

[x + y] mod A = [[x] mod A + y] mod A 

[QAi(x)] mod A = [Qai([x] mod A)] mod A 

[ax] mod A = [a[x] mod A] mod A Va G Z 

/3[x] mod A = [/3x] mod (3 A V/3 G R 

Definition 17 (Nested Lattice Codes): A nested lattice code C is the set of all points of a fine lattice Ai that are 
within the fundamental Voronoi region V of a coarse lattice A: 



(18) 

(19) 
(20) 
(21) 
(22) 



£ = AinV = {x:x = A mod A, A G Ai}. 



The rate of a nested lattice code is: 



1 ^ \r\ 1 ^ Vol ( V ) 
-log £ = - log———. 

n n Vol(Vi) 




(23) 



(24) 



Fig. 2. Part of a nested lattice AcAi C R 2 . Black points are elements of the fine lattice Ai and gray circles are elements of the coarse 
lattice A. The Voronoi regions for the fine and coarse lattice are drawn in black and gray respectively. A nested lattice code is the set of all 
fine lattice points within the Voronoi region of the coarse lattice centered on zero. 



Our scheme relies on mapping messages from a finite field to codewords from a nested lattice code. The relay 
will first decode an integer combination of lattice codewords and then convert this into an equation of the messages. 
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Definition 18 (Lattice Equation): A lattice equation v G C is an integer combination of lattice codewords tg G C 
modulo the coarse lattice: 

L 



v = 



mod A (25) 



for some coefficients ae G Z. 

Let B(r) denote an n-dimensional ball of radius r: 

B(r) = {x : ||x|| < r, x G M n } (26) 

and let Vol(,B(r)) denote its volume. 

Definition 19 (Covering Radius): The covering radius of a lattice A is the smallest real number r cov such that 
R n CA + B(r cm ). 

Definition 20 (Effective Radius): The effective radius of a lattice with Voronoi region V is the real number r EFFEC 
that satisfies Vol(,6(r EFFEC )) = Vol(V). 

Definition 21 (Moments): The second moment of a lattice A is defined as the second moment per dimension of 
a uniform distribution over the fundamental Voronoi region V: 

a\ = [ ||x|| 2 dx. (27) 



nVol(V) Jv 

The normalized second moment of a lattice is given by: 

G < A » " w§w <28) 

The following three definitions are the basis for proving AWGN channel coding theorems using nested lattice 
codes. 

Definition 22 (Covering Goodness): A sequence of lattices A*™) C W l is covering good if: 

r (n) 

lim -£5L = 1. (29) 

1 EFFEC 

Such lattices were shown to exist by Rogers [52]. 

Definition 23 (Quantization Goodness): A sequence of lattices A( n ) C M. n is good for mean-squared error (MSE) 
quantization if: 

lim G(A(™)) = — . (30) 
n-+oo 27re 

Zamir, Feder, and Poltyrev showed that sequences of such lattices exist in [53]. 

Definition 24 (AWGN Goodness): Let z be a length-n random vector with distribution Af(0, a 2 z l nxn ). The volume- 
to-noise ratio of a lattice is given by: 

„ (A ,P e)= (W: (31) 

where <r| is chosen such that Pr{z ^ V} = P e - A sequence of lattices A^ ra ) is AWGN good if 

lim fi(A in \P e ) = 2ire VP e G (0, 1) (32) 

and for fixed volume-to-noise ratio greater than 2ire, the probability of error decays exponentially in n. In [54], 
Poltyrev demonstrated the existence of such lattices. 
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B. Lattice Constructions 

Our nested lattice codes are a slight variant of those used by Erez and Zamir to approach the capacity of a point- 
to-point AWGN channel [13]. As in their considerations, we will have a coarse lattice that is covering, quantization, 
and AWGN good and a fine lattice that is AWGN good. We generalize this construction to include multiple nested 
fine lattices all of which are AWGN good. This will allow each transmitter to operate at a different rate. 

Lemma 1 (Erez-Litsyn-Zamir): There exists a sequence of lattices A( n ) that is simultaneously covering, quanti- 
zation, and AWGN good. 

This is a corollary of their main result which develops lattices that are good in all the above senses as well as for 
packing [55, Theorem 5]. Note that these lattices are built using Construction A which is described below. 

We will use a coarse lattice A of dimension n from Lemma 1 scaled such that its second moment is equal to 
Let L G M nxn denote the generator matrix of this lattice. Our fine lattices are defined using the following 
procedure (the first three steps of which are often referred to as Construction A [55], [56]): 

1) Draw a matrix Gl € F" xfcl - with every element chosen i.i.d. according to the uniform distribution over 
{0,1,2, ... ,p — 1}. Recall that p is prime. 

2) Define the codebook Cl as follows: 

C L = |c = G L w: weFj'}. (33) 

All operations in this step are over F p . 

3) Form the lattice A^ by projecting the codebook into the reals by g(-), scaling down by a factor of p, and 
placing a copy at every integer vector. This tiles the codebook over W 1 : 

A L = p-^(C L ) + Z" (34) 

4) Rotate Ax, by the generator matrix of the coarse nested lattice to get the fine lattice for transmitter L: 

A L = LA L (35) 

5) Repeat steps 1) - 4) for each transmitter £ = 1,2, . . . ,L — lby replacing Gl with Ge which is defined to 
be the first kt columns of Gl- 

Clearly, any pair of fine lattices A^ , A^ 2 , 1 < £\ < £2 < L are nested since all elements of can be found from 
Gg 2 by multiplying by all w G ¥ nxke 2 with zeros in the last £ 2 — £\ elements. Also note that A = LZ ra is nested 
within each fine lattice by construction. Finally, we get the desired set of nested lattices A C Ai C ■ ■ ■ C Al- 

By the union bound, we get that: 



Pr (J {rank(G,) < k e } J < £ £ Pr{G,w = 0} 

V=i / e=i weF j, 

L 

<p- n Y,(P kt ~ l ) ( 36 ) 
e=i 

Thus, so long as p or k\, . . . , Ul grow appropriately with n, all matrices Gi, . . . , Gl are full rank with probability 
that goes to 1. Note that if Gi has full rank, then the number of fine lattice points in the fundamental Voronoi 
region V of the coarse lattice is given by |A^ n V| = p ke so that the rate of A^ is 

i^ = -iog|A*nv| = — io g2 p = \n (37) 

n n 2 

as desired. Furthermore, each message vector w G can be put into one-to-one correspondence with a point in 
A( n V as shown in Lemma 5. In Appendix II, we show that the fine lattices are AWGN good so long as ^ —> as 
n grows. It is clear that one can choose p,k\, . . . ,k,L so that the fine lattices have all the desired properties. One 
possibility is to let p grow like nlogn and set hi = L^-R^(logp) _1 J . 

Lemma 2: Any lattice A that results from Construction A has a full-rank generator matrix L. 
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Proof: Note that Z" C A so that A contains all of the unit vectors by default. Thus, L spans IR n and is full 
rank. ■ 

Remark 8: We require that the fine lattices are generated from full-rank submatrices of the same finite field 
codebook so that it is possible to compute linear equations over messages with different rates. The full rank 
condition on the coarse lattice allows us to move between lattice equations and equations of finite field messages. 

In [55], [57], some useful properties of nested lattices derived from Construction A are established. These apply 
to our construction as well and we repeat them below. 

Lemma 3: Choose any fine lattice A^ from the construction above and let A^(i) denote the i th point in A^ n V 
for % = 0, 1,2, . . . ,p ke - 1. We have that: 

• A.g(i) is uniformly distributed over p~ l k n V. 

• For any i\ / 12, [A^(«i) — A^^)] mod A is uniformly distributed over -{j> _1 A} n V. 

Thus, each fine lattice can be interpreted as a diluted version of a scaled down coarse lattice p _1 A. 

IV. COMPUTE-AND-FORWARD 

We now develop our main result which provides achievable rates for computing linear functions of messages 
across AWGN networks. As we will see, these rates are often far higher than those available either by first recovering 
all messages then computing the function or by standard i.i.d. random coding arguments. 

First, we will state the achievable computation rate and show how to optimally choose the free parameter in the 
rate expression. Next, we will provide some illustrative examples. In Section IV-A, we give a detailed proof of the 
main result. 

Let (x) + =max(x, 0). 

Theorem 1: For channel vectors hi,...,hM G C L , the relays can recover any set of linear equations with 
coefficient vectors ai, . . . , 3lm G {2, + jZ} L so long as the message rates are less than the computation rate 

r e < min (i?(h m ,a m )) + (38) 

m:a m i=t0 

R(h m , a m ) = log ( — — -) (39) 

V|a m | 2 + SNR||a m h m - a m ||V 

for some choice of a m G C. 

Theorem 2: For a given h m G C L , a m G {Z + jZ} L , the computation rate given in Theorem 1 is uniquely 
maximized by choosing a m to be the MMSE coefficient 



SNR h> m 



MMSE l + SNR||h m || 2 

which results in a computation rate 



(40) 




-r 



Proof: Note that the denominator of (39) is quadratic in a m . Thus, it can be uniquely minimized by setting 
the first derivative to zero. 

f(a m ) = a* m a m + SNR(a m h m - a m )*(a m h m - a m ) 

JL = 2 a m + SNR(2a m h> m - 2h^a m ) = (42) 

da m 

a m (2 + 2SNR||h m || 2 ) = 2SNR h> m (43) 
We solve this to get a MMSE and plug back into f(a m ). 

SNR 2 |h^a m | 2 SNR 3 ||h m || 2 |h; t a m | 2 

/V^MMSE/ , no\2~'~ 



(l + SNR||h m || 2 ) 2 (l + SNR||h m || 2 ) 2 
SNR 2 lhIa^l 2 ..o 
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Substituting this into log 



SNR 



yields the desired computation rate. 



, /( q mmse) / 

The above two theorems can be interpreted in the following sense. Each transmitter uses a fixed (lattice) codebook 
with a specified rate. By setting a m = 1 and decoding to the closest lattice point (in Euclidean distance), the relays 
could recover any linear equation so long as the message rates are less than 

l0g ( l + SNRBh„ R -a m ||0 (46) 



However, if each relay scales its channel output by the appropriate a prior to decoding, then higher rates are 
possible. This is because the channel output a m y m = a m h, m p^i + a m z m can be equivalently written as a 
channel output y m = ^mi^-i + z m where h m £ = a m h m e and z m is i.i.d. circularly symmetric Gaussian noise 



with variance \a m \ . Since there is a rate penalty both for noise and for non-integer channel coefficients, then a 
should be used to optimally balance between the two as in Theorem 2. This is quite similar to the role of the 
MMSE scaling coefficient used by Erez and Zamir to achieve the capacity of the point-to-point AWGN channel in 
[13]. 



4 
3.5 

3 



£2.5 



o 1.5 

O 



1 

0.5 





9 o p 



10 15 
Coefficient Vectors 



T ? T T 9 9 ^ 



20 



25 



Fig. 3. Computation rate at SNR = 5dB for different coefficient vectors a 6 {Z + JZ} 2 for channel 

h= [-1.1744 + j2. 1496 1.2512 - jl.6335] T . The coefficient vector with the highest rate is a = [1 - 1] T with R = 3.5436. 
Using standard multiple-access codes, this equation can only be computed at rate _Rmac = 2.5301. 



We will often be interested in finding the coefficient vectors with the highest computation rates. This does not 
require a search over all integer vectors as most vectors trivially give zero rate. 

Lemma 4: For a given channel vector h G C L , the computation rate from Theorem 2 is zero if the coefficient 
vector a G {Z + jZ} L satisfies: 

llall 2 > 1 + ||h|| 2 SNR. 



Proof: Note that |h^a m | 2 < ||h, 
the achievable rate from Theorem 2: 

log 

= log 
< log 

The result follows immediately. 



,2 || 
m|| || a n 



(47) 

by the Cauchy-Schwarz inequality. Using this, we can upper bound 




SNR \h* m a m \ 2 
l + SNR||h m || 2 



1 + SNRIIK 



(48) 



HaJI 2 + SNR||h m ||2||a m p - SNR |h^a m |2 
1 + SNR||h m || 2 



| a m| 



(49) 
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In Figure 3, we have plotted the computation rates for coefficient vectors with non-zero entries for an example 
channel of length 2. The vectors are sorted by descending rate and we only include one of the four possible rotations 
of each vector, a, — a, ja, and — ja, as these yield identical rates. 

Remark 9: Note that each relay is free to decode more than one equation, so long as all the appropriate 
computation rates are satisfied. In some cases, it may be beneficial to recover a desired equation by first decoding 
equations of subsets of messages and then combining them. 

Example 1: Let the channel matrix take values on the complex integers, H G {L + jZ} MxL , and assume that 
each relay wants a linear equation with a coefficient vector that corresponds exactly to the channel coefficients, 
a m = h m . Using Theorem 2 the achievable computation rate is: 

^logllll^f- ,^^ )"') < 50 > 
l + SNRIIM 2 , (51) 




h„,||2+ SNRHhJIl-SNRHhJIl 



= 1 ° g (pV SNR ) <52) 
Remark 10: One interesting special case of Example 1 is computing the sum of codewords wi © W2 over a 

two-user Gaussian multiple-access channel y = xi + X2 + z. To date, the best known achievable rate for this 

scenario is log Q + SNR). Several papers (including our own) have studied this special case and it is an open 

problem as to whether the best known outer bound log(l + SNR) is achievable [28]-[30]. Clearly, one can do 

better in the low SNR regime using standard multiple-access codes to recover all the messages then compute the 

sum to get ±log(l + 2SNR). 

Example 2: Assume there are L transmitters and L relays. Receiver m wants to recover the message from 

transmitter m. This corresponds to setting the desired coefficient vector to be a unit vector a m = [0 • • • 1 • • • 0] T 

where the m th element is 1 and the rest are 0. Substituting this choice of a m into Theorem 2, we get that the 

messages can be decoded if their rates satisfy: 

. ((. SNR|fr mm | 2 
rm<log il 1 " l + SNR||M 2 J ) (53) 

lQg " 1 + SNRHM 2 J ' ( ) 

/ SNR|/w| 2 \ 
= l0 H 1+ l + SNR||h m f j (55) 

This is exactly the rate achievable with standard multiple-access techniques if the relays ignore all other messages 
as noise. In Section VI, we will use successive cancellation of lattice equations to show that if a relay wants all of 
the messages, any point in the Gaussian multiple-access rate region is achievable with compute-and-forward. 

The following example shows that it is useful to allow for a different rate at each transmitter. 

Example 3: We have L = 4 transmitters and M = 2 relays. The channel vectors are hi = [4 - 4 1 - If and 
h.2 = [1 1 2 2] T . The desired coefficient vectors are ai = hi and a2 = [0 1 1]. These equations can be 
reliably recovered so long as the message rates satisfy: 

flog (i + SNR) 1=1,2 

A. Proof of Theorem 1 

We now provide a detailed description of our encoding and decoding scheme. The following four steps are a 
basic outline: 

1) Each transmitter maps its message from the finite field onto an element of a nested lattice code. 

2) Lattice codewords are dithered and transmitted over the channel. 
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3) Receivers reliably decode a linear equation of the lattice codewords. 

4) These lattice equations are mapped back to the finite field to get the desired linear combination of messages. 
We will first show how to go from messages to lattice codewords and from lattice equations back to linear 

combinations of messages. Then, we prove that linear equations of lattice codewords can be recovered at the 
desired computation rates. Finally, we will combine all of these steps to get our compute-and-forward scheme. 

The following lemma provides a mapping from messages over the finite field to points in a nested lattice code. 

Lemma 5: The function fa : F^' — ► Ac n V is one-to-one where: 

fa(w) = [LjTVG/w)] mod A. (57) 
Proof: Since G^ is assumed to be full rank, it takes w to a unique point in Q. Thus, p _1 g (Ggw) maps w 
to a unique point in [0, l) n . Lemma 2 shows that L is full rank so we just need show that the mod A operation is 
a bijection between L[0, l) n and V. Assume, for the sake of a contradiction, 3x,y G L[0, l) n ,x / y such that [x] 
mod A = [y] mod A. This implies that: 

x - Qa(x) = y - Q\{y) 
[L^Or - Qa(x))] mod I n = [L" 1 ^ - Q A (y))\ mod I n 
[L^x] mod I n = \L~ l y] mod I n 

x = y 



where the third step follows since for any A G A, L _1 A G I n . A contradiction has been reached which, combined 
with the fact that |Fp £ | = |A^ n V| = p ke , shows that fa is a one-to-one map. ■ 

Assume the messages are encoded onto nested lattice codes using fa from Lemma 5 and that the relays 
successfully recover a linear combination of lattice points. The following lemma shows how to convert these 
lattice equations back to the desired messages equations. 

Lemma 6: For messages G ¥ ki zero-padded to length k m , let u = ^ qe^e for some coefficients q£ G ¥ p . 
Also, set te = fa(w£) and v = a(t(\ mod A for some ai G Z such that qi = g~ 1 ([ae] mod p). We have that 
u = (f)~n ( v) where 

^(v) = (GlG m )- l Glg- 1 (p\L-\] mod Z n ) . (58) 
Proof: Recall that since L is the generator matrix of A, L _1 A = Z n . Also note that since W£ is zero-padded 
to length k m , then multiplying by G m has the same effect as multiplying the original message by Ge- We have 
that: 



[L _1 v] mod I r ' 



L 1 ^a <! t <; + L 1 Q A l^aett 



mod II- 



mod I n 



e=i 

L 

^2 ae (p _1 ff(G m w«) - L~ 1 Q A (Lp~ 1 g(G m w e ))) 

L 

^ a(.p~ 1 g{G m W() mod T l 
I=i 



mod I n 



(59) 
(60) 

(61) 

(62) 

(63) 
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Multiplying by p we get: 



p[L _1 v] mod U' 



^2 aeg(G m w e ) 
U=i 

9 ^(J) qeG m w e 



= 9 ^© QeGmW^J 




(64) 
(65) 
(66) 



Applying g 1 to move back to the finite field we get: 



g- 1 (^L-V] mod Z n ) = G m (67) 

e=i 

Finally, note that (G^G m ) 1 G^ is the left-inverse of G m and we get that (v) = u. ■ 
The proofs for our lattice coding scheme will make use of random dither vectors which are available at the 

transmitters and relays as common randomness. Since our scheme works with respect to expectation over these 

vectors, then it is clear that (at least) one set of good fixed dither vectors exists. See Appendix III for more details. 

The following lemma from [13] captures a key property of dithered nested lattice codes. 

Lemma 7 (Erez-Zamir): Let t be a random vector taking values on M. n . If d is independent of t and uniformly 

distributed over V, then [t — d] mod A is also independent of t and uniformly distributed over V. 

Theorem 3: For any e > and n large enough, there exist nested lattice codes A C Ai C • • • C Al with rates 

Ri, . . . , Rl, such that for all channel vectors hi, ... , € C L and coefficient vectors ai, . . . , &M € {1 + j^} L , 

each relay can decode lattice equations v^j, where 



v 



R 



Re(a m £)tf - \m(a mi )tj 

L 

Y lm(ow)tf + Re(a m£ )t| 
£=l 



mod A 



mod A 



(68) 
(69) 

(70) 
(71) 



of transmitted lattice points tf,tg £ A^nV with total probability of error e so long as: 

R e < min (ii(h m ,a m )) + 

> 1 ( SNR 

i?(h m ,a m ) = - log | „ 

2 VFml 2 + SNR||a m h m - a m || 2 

for some choice of a m G C. 

Proof: Each encoder is given two dither vectors, and d|, which are independently drawn according to 
a uniform distribution over V. All dither vectors are made available to each relay. Encoder £ generates a channel 
input: 

= [tf - df ] mod A + j[t\ - &\] mod A. (72) 

By Lemma 7, the real and imaginary parts of are independent and uniform over V so ^.E[||x^|| 2 ] = SNR, with 
expectation taken over the dithers. In Appendix III, we argue that there exist fixed dithers that meet the power 
constraint set forth in (3). 

The channel output at relay m is: 

L 

y m = ^2 h m£ x e + z m . (73) 
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Let £(m) = max{^ : a m £ / 0} and let Q rn denote the lattice quantizer for A^ m y Note that £(m) is the highest 
rate message in the equation and thus the rate of the equation that relay m wants to decode. Each relay computes: 

L 

= Re(a m y m ) + Re(a rne )df - \m{a ml )d{ (74) 



l=\ 

L 



4n = lm(a m y m ) + ^ lm(a m ^)df + Re{a me )d\. 



(75) 



To get the estimates of the lattice equations, these vectors are quantized onto k^t m \ modulo the coarse lattice A: 

v m = [Qm(s£)] mod A (76) 

V m = [Qm(0] mod A. (77) 

Note that by (20) we have: 

[Q m (s£)] mod A = [Q m ([s m ] mod A)] mod A. (78) 
We now show that [s^] mod A is equivalent to plus some noise terms. 



[s m ] mod A 

L 

^ Re(a m h m e)Re(x.e) - lm(a m /i m £)lm(x£) + Re(a m £)df - \m(a m i)d\ + Re(a m z r 



mod A 



^2 Re(a m ^)(Re(x £ ) + df) - lm(a„^)(lm(x^) + d e ) + Re ((a m h m£ - a m e)x e + a m z m ) 

L 

Re(a m£ )([tf - df] mod A + df) - \m(a ml ){[t\ - d{] mod A + dj) ■■■ 



mod A 



a m z, 







mod A 



••• + Re(^(a m h m e - a m e)xe 

L 

^2 Re(a m e)tf - \m(a mi )tl + Re((a m h m e - a m i)x e + a m z m ) 
l=\ 

L 

v m + Re(a m z m ) + ^ Re(a m h me - a ml )[tf - df] mod A - \m{a rn h ml - a rni )[t\ - d\] mod A 



mod A 



(79) 
(80) 
mod A 



Using similar manipulations, it can be shown that [s m ] mod A is equivalent to plus some noise terms as well. 
Using Lemma 7, we can show that the channels from to and from to are equivalent in distribution 
to: 

(81) 
(82) 

(83) 

(84) 

(85) 
(86) 



m 

v 1 

TO 


= [Qm(v£ + z*J] mod A 
= [<2m« + z^J] mod A 




7 R 

a eq,m 


L 

= Re(a m z m ) + J2 9 %tdf - 
i=\ 




a eq,m 


L 

= lm(a m z m ) + ^0^df + 

£=l 

= Re(am^m£ - «w) 
= \m(a m h rn e - a m i) 


Q R A 1 


aR 
6ml 
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where each and dj, is drawn independently and uniformly from V. 

Using Lemma 8 from Appendix I, we have that the densities of both and z 1 m are upper bounded (times 

a constant) by the density of an i.i.d. zero-mean Gaussian vector z* m whose variance cr^ approaches 

^ m = ^ + ^((C) 2 + (C) 2 ) (87) 

Wm\ 2 SNR 2 

= — 2 h — ^-||a m h m - a m || z (88) 

as n — > oo. We also show in Appendix II that Ai, A2, . . . , Ax, are AWGN good. From Definition 24, this means that 
Pr(z^ ^ V^( m )) goes to zero exponentially in n so long as the volume-to-noise ratio satisfies /x(A^ m ),P e ) > 2-7re. 
Since decoding is only in error if or Zg ? m leave the fundamental Voronoi region of the fine lattice A^ m ) and 
the probability of this event is upper bounded by Pr(z^ ^ VW m )) times a constant, we get that Pr(v^ / v^) and 
Pr(v^ / v^J go to zero exponentially in n. 

To ensure that the probability of error goes to zero for all desired equations, we get that the volume of V^ m ) 
must satisfy 

27re < ^(A|. (m) ,P e ) = ^ (89) 

for all relays with a m £ / 0. If we set the volume of Ve as follows, the constraints are always met: 

( x n/2 

Vol(V^) > 2vre max a 2 m (90) 

V m:a me ^0 J 



Recall that the rate of a nested lattice code is 



1. / Vol(V) \ 



R '-v^{v<m)- (91) 



Using (28), we can solve for the volume of the fundamental Voronoi region of the coarse lattice: 

SNR/: 

G(A) 



VoKVH«r (92) 



It follows that we can achieve any rate less than: 



1 / SNR/2 



m:a m i 



^o2 ° yr;.;.\)2-f l 



R e < min - log — — 1 . (93) 



while driving the probability of error to zero. Choose 5 > 0. Since A is good for quantization, for n large enough, 
we have that G(A)27re < (1 + 5). We also know that cr^ converges to N eq ^ m so for n large enough we have 
<7^ < (1 + 5)N eq:Tn . Finally, we get that the rate of each nested lattice code is at least: 

!, ( SNR \ , lA 

mm - log ^5 p .,-|. : TTcj - log(l + d) 

m:o»</o 2 y \a m \ z + SNR||a m h m - a m ||^/ 

Thus, by choosing <5 small enough, we can approach the computation rates as closely as desired. ■ 

Remark 11: Note that (71) differs from (39) by a factor of \. This is because (71) is the rate of the nested lattice 
code which operates in the reals and (39) is the rate of the message which has a real and an imaginary component. 

Remark 12: The proof of Theorem 3 only requires channel state information at the relays. In particular, each 
relay only needs to know the channel coefficients from the transmitters to itself h m . 

We now put all of these ingredients together to prove Theorem 1 . See Figures 4 and 5 for block diagrams of the 
encoding and decoding process. 

Encoder I maps its finite field message vectors, w^,w| G ~¥ k p l , to lattice points, tf,t\ 6 A; fl V, using <^(-) 
from Lemma 5. 

tf = &(w?) (94) 
t£ = <Mwf) (95) 
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Fig. 4. Block diagram of the compute-and-forward encoder for transmitter I, £i. Messages from a finite field are mapped onto a nested 
lattice code, dithered, and transmitted across the channel. 




Remove 
Dithers 



[Q m ( )]mod A 










4>m 



R 



Remove 
Dithers 



[Q m ( )]mod A 






dr 1 




rm 



'Ut, 



Fig. 5. Block diagram of the compute-and-forward decoder for relay m, T> m , The channel observation is scaled and decomposed into its 
real and imaginary components. The decoder then removes the dithers, quantizes onto the appropriate fine lattice, and takes the modulus 
over the coarse lattice. This results in an equation of lattice codewords which is then mapped into an equation of messages over the finite 
field. 



Using Theorem 3, these lattice points are transmitted across the channel. Since the message rates are twice the 
lattice code rates, r£ = 2R^, the relays can recover lattice equations 

L 

^2 Re(a m£ )tf - \m{a ml )t\ mod A 



v 



R 



L 



^2 lm (°w)tf + Re(a m£ )t^ 



mod A 



(96) 
(97) 



for any ai , . . . , slm G + jZ} L so long as 



Tf < min ( log 



SNR 



+ 



\a m \ 2 + SNR||a m h m - a m || 2 , 

for some a\, . . . , au £ C. Finally, using from Lemma 6, each relay can decode the desired linear combination 
of messages: 



u 



m rm 



U 



to rm 



1 (v^) = ©g^wf e(-^)w| 

L 



(98) 
(99) 



e=i 



B. Real-Valued Channels 

We now restate our main results for real-valued channels. 
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Corollary 1: For real-valued channel vectors hi, ... , \im € R L , the relays can recover any set of linear equations 
with coefficient vectors ai, . . . , &m € Z L so long as the message rates are less than the computation rate 

n < min (i?(h m ,a m )) + (100) 

m:o m i7=0 

R(h m , a m ) =\ log ( ^ CMD ,f NR , ^) (101) 

2 V«4 + SNR||a m h m -a m || 2 y 

for some choice of a m € R. 

The coding scheme is quite similar to that for complex-valued channels: encoder £ maps its finite field message 
vector, £ Fp £ , to a lattice point, t< 6 A; n V, using <pi{-) from Lemma 5. 

t, = <Mw,) (102) 

It follows from the proof of Theorem 3 that if these lattice points are transmitted across the channel then each 
relay can recover a lattice equation 

" L 

.£=1 

for any ai , . . . , &m S Z l so long as the message rates satisfy 

. 1/, / SNR xx + 

< min — I log I 

mW0 2\ V.«m + SNR ll«m h m - a m || 2 , 

for some «i, . . . , «m S R. Finally, using 4>~} from Lemma 6, each relay can decode the desired linear combination 
of messages: 

L 

Urn = 0m (O = (104) 

Corollary 2: For a given h m G R L , a m G Z L , the computation rate given in Corollary 1 is uniquely maximized 
by choosing a m to be the MMSE coefficient 



mod A (103) 



SNR hla 

n/ = — 

"MMSE 

which results in a computation rate 



"mmse - 1 + SNR||hJ| 2 K } 



2 fa I V l + SNR||h m || 2 



C. Multi-Stage Networks 

The framework developed in this section can easily be applied to AWGN networks with more than one layer of 
relays. Once the first layer has recovered its equations, it can just treat them as a set of messages for the second 
layer. The second layer simply decodes equations with coefficients that are close to the channel coefficients. This 
process repeats until the equations reach a destination. Since these layered equations are all linear, they can be 
expressed as linear equations over the original messages. 



V. Recovering Messages 

The primary goal of compute-and-forward is to enable higher achievable rates across an AWGN network. Relays 
decode linear equations of transmitted messages and pass them towards the destination nodes which, upon receiving 
enough equations, attempt to solve for their desired messages. In this section, we give sufficient conditions for 
recovering messages from a given set of equations. 
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It will be useful to represent the equations in matrix form. Let Q R = {q^e} and Q 1 = {q^e}- Now let Q denote 
the coefficient matrix where 

-Q 7 
Q 1 Q R 



Q 



(106) 



It is clear that we can write the linear combinations of messages and u^ t as 



uf 




wf 


u{ 


= Q 


w{ 






. W M . 



(107) 



Using this representation, we can easily obtain the following two theorems for recovering messages. 
Theorem 4: Given L linear combinations of messages with coefficient matrix Q G ¥ 2Lx2L , a destination can 
recover all messages if Q is full rank over F p . 

The proof follows by noting that if Q is full rank, the destination can simply apply the inverse matrix to the vector 
of equations in (107) to recover the original messages. 

Recall that 5g is the unit vector with 1 in the £ th entry and elsewhere. 

Theorem 5: Given K linear combinations of messages with coefficient matrix Q G ¥ 2Kx2L , a destination can 
recover the message from encoder I if there exists a 2 x 2K matrix $ such that 

5J 



°t+M 



(108) 



Again, it is clear from (107) that applying $ to the vector of equations allows the destination to recover vjf and 
since they occupy positions t and t + M in the vector of messages. 
Remark 13: These conditions can also be easily stated in terms of the coefficient vectors ai, . . . , slm £ {Z+jZ} L . 
First set A = [ai • • • slm} T and let A denote its real-valued decomposition: 



Re(A) 
lm(A) 



-lm(A) 
Re(A) 



(109) 



Now one can replace Q with A in Theorems 4 and 5 so long as all operations are taken modulo p. 

It may be more convenient to evaluate the rank of the coefficients directly on the complex field. This is possible, 
given some assumptions on the equation coefficients. 

Theorem 6: The destination is given L linear equations with coefficient vectors ai, . . . ,sll € ¥ 2 l Lx2L . Assume 
that magnitude of each equation coefficient is upper bounded by a constant, \a m (\ < omax • Then, there exists an 
no such that for all blocklengths n > no, the destination can recover all messages if A = [ai • • • aiJ T is full rank 
over the complex field. 

Proof: A is full rank over the complex field if and only if its real-valued decomposition A from (109) is full 
rank over the reals. Recall that a matrix is full rank only if its determinant is non-zero. We will now show that for 
sufficiently large p, if the determinant of A is non-zero over the reals it is non-zero modulo p. The determinant 
over R can be written as: 



2L 



det(A) = ^sgn(a) J] 



(110) 



a-es 



m=l 



where S is the set of all permutations of {1, 2, ... , 2L}, sgn(o-) is the signature of the permutation which is equal 
to 1 if it is an even permutation and — 1 if it is an odd permutation, and a m £ are the entries of A. Using the bound 
on a m e and the fact that |<S| = (2L)!, the determinant is lower and upper bounded as follows: 

-(2L)!(a MA x) 2L < det(A) < (2L)!(a MA x) 2L (111) 



SUBMITTED TO IEEE TRANS INFO THEORY, AUGUST 2009. 



20 



The determinant under modulo p arithmetic can be written as: 

2L 

sgn^) n 



.aeS 



m=l 



mod p 



(112) 



Since the underlying field size p — > oo as n — > oo, for large enough blocklength n, we can use the bounds on 
det(A) to show that the determinant modulo p does not wrap around zero. This immediately implies that it is zero 
if and only the determinant is zero over the reals. ■ 
Remark 14: Theorem 6 can also be stated in terms of bounds on the channel coefficients. For instance, if 
\h m (\ < /imax, then we can use the bound in Lemma 4, to show that \a m (\ is bounded as well. More generally, 
the result holds if the channel coefficients are drawn from a distribution such that Pr (U m £{|/i m ^| > h M Ax}) — ► 
as tiMAx — ► oo. In this case, we choose h M AX such that this probability is very small and can be absorbed into the 
total probability of error for our scheme. The result follows by taking an appropriate increasing sequence of h MAX . 



Wi ■ 



W2 - 




x m A, yti 



Wi 
W2 



Fig. 6. A linear relay network where compute-and-forward is beneficial. 



Example 4: Consider the AWGN network in Figure 6. Encoder Si,... ,£m send messages through a channel 
H to relays TZ\, . . . ,1Zm- Each relay has a point-to-point AWGN channel to the final decoder V which wants to 
recover all of the messages at the highest possible symmetric rate. Each channel input has power at most SNR and 
all noise terms are i.i.d. circularly symmetric Gaussian with variance 1. Let H be an M x M Hadamard matrix. 
(We assume that M is chosen such that a Hadamard matrix of that size exists. ) Recall that a Hadamard matrix 
has ±1 entires such that HH T = Ml. 

Using Theorems 2 and 6 and setting the coefficient vectors equal to the channel vectors, a m = h m , compute- 
and-forward can achieve 



(113) 



bits per channel use per user since H is full rank. It can be shown that decode-and-forward, amplify-and-forward, 
and compress-and-forward (with i.i.d. Gaussian codebooks) can achieve 



/2 nF = -llog(l + MSNR) 



i? AF = R cf = log 1 + SNR 



SNR 



("MSNR + 1 



(114) 
(115) 



bits per channel use per user. Compute-and-forward is the dominant strategy except at very low SNR and it 
approaches the upper bound i? UPPER = log (1 + SNR) as SNR — > oo. As M increases the rates of decode-and- 
forward, amplify-and-forward, and compress-and-forward go to 0. 

Example 5: There are 3 destinations which each want to decode the message associated with their index. Assume 
that the underlying field size is p = 3. The first destination receives equations with coefficient vectors ai = [4 2 2] T 
and a 2 = [5j — 1 — 1] T , the second receives bi = [1 2 1] T and b 2 = [-1 — 1] T , and the third receives 
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c = [0 1 + j] T . The finite field matrix representation of these equations is: 





" l 




2 
2 


2 " 

2 


Qi = 


' 

2 






" 




Qf = 


" l 

2 


2 



1 " 

2 


Q£ = 


" 







" 




Qc = 









Qi = 


[o 








Using Theorem 5 with the following matrices, each destination can recover its desired message: 



2 
1 

2 
1 



(116) 

(117) 
(118) 

(119) 
(120) 



VI. Successive Cancellation 



Once a relay has recovered an equation of messages, it can subtract it from its channel observation. This results 
in a residual channel output from which it can extract a different equation, potentially with a higher rate than 
possible over the original channel. One key difference from standard applications of successive cancellation is that 
the relay cannot completely cancel out all channel inputs associated with the decoded equation. This is because in 
the first step, it only decodes an integer combination of the messages, which is often not the same as the linear 
combination taken by the channel. 

We demonstrate an achievable region for decoding two different equations using successive cancellation at each 
relay. This can be easily generalized to more than two equations. 

Theorem 7: For channel vectors hi,. . . ,h.M € <C L , the relays can first decode any set of linear equations with 
coefficient vectors ai, . . . , slm € {Z + jZ} L and then any set with coefficient vectors bi, . . . ,\)m £ + j^} L so 
long as the message rates are less than the computation rates: 



re < min min i?i(h m ,a m ), min R 2 (h m , a m , b r 

SNR 




I Q-m | ^ S N R 1 1 (Xm hm &m 1 1 ^ 

Rbi, a m = Si for some i, 
Rb2, otherwise. 

SNR \ 



|/3 m | 2 + SNRE^i \(3 m h mt - b rne \ 2 J 
SNR 

\(3 m \ 2 + SNR||/3 m h m - r m a m - b m || 2 



for some choice of a m , j3 m £ C and r m eZ| jZ. 

Proof: All messages are mapped on to lattice points, dithered, and transmitted across the channel as in the 
proof of Theorem 1 . The first set of equations can be reliably decoded using the procedure from Theorem 1 as well. 
Now, we condition on the event that each relay has successfully recovered the equation with coefficient vectors 

Consider the case where the first coefficient vector at relay m is a unit vector a m = Si. This means that relay m 
can successfully decode the message (wf ,w[) from encoder i. It can then replicate the encoding process to get 
Xj. Now, the relay computes 

y m - hmi^i = ^ h rn e^E + Zm (121) 
t+i 
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and uses this as a channel output for Theorem 1 to get the equation with coefficient vector b m which is equal to 
b m except that it has in the i th position. It then adds and w 7 to the recovered equation to get h m . 
If a m is not a unit vector, the decoder has access to the lattice equations: 

" L 

Re(ow)tf - \m(a me )t{ mod A 



v m 



^2 lm(a m ^)tf + Re(a m£ )t 7 



mod A 



(122) 
(123) 



From which it computes 



v 7 



v m ~ ( £ Re ( a «^) d ? - lm(<w)df 
\e=i j 

L 

y Re(a m< ?x <; ) mod A 
v ™ ~ (j2 lm ( a ™^) d f + Re(a m£ )d| 

L 

^2 \m{a m ixi) 



mod A 



(124) 



mod A 



1=1 



mod A 



(125) 



y% = [Re(p m y m ) - Re(r m )v^ + lm(r m )v^] mod A 



Re I ^ ((3 m h m e — T m a m £)it£ + z r , 



mod A 



[lm(/? m y m ) - lm(r m )v* - Re(r m )v m ] mod A 
lm ^2 (Prnhmi - T m a m e)x e + mod A 

Now we can follow the steps in the proof of Theorem 3. In (74), replace Re(a m y m ) with y„ and in (75), replace 
lm(a m y m ) with y^. In all steps of the proof, we substitute a ml with b mt , a m h ml with f3 m h mt - T m a mt , and, if 
has not already been replaced, a m with (3 m . ■ 
Remark 15: Given, a m , b m , and r m , we can solve for the optimal (3 m following the steps of the proof of 
Theorem 2. 

Example 6: There are L = 4 transmitters and M = 1 relay and the channel vector is hi = [10 10 8j 8j] T . The 
relay wants to first decode the equation with coefficient vector ai = [1 1 j j] T and then with coefficient vector 
bi = [1 I — j — j] T . Using Theorem 7, this is possible if the message rates satisfy: 



, ,1 81SNR \ 
^ <mm(l0g( 4 + TTISNR 



)' l0g (4 + SNR )) 



by using n = 9 so that hi — nai = bi. Note that if we applied Theorem 1 to decode bi, we would not be able 
to get a positive rate. 

Remark 16: As noted in Remark 9, it may be more efficient to recover an equation piecewise by recovering 
equations of subsets of messages and taking an appropriate linear combination of these equations. Theorem 7 is 
strictly better for this process than Theorem 1. 

Assume there is only one relay and that it wants to recover all transmitted messages. This is the standard Gaussian 
multiple-access problem whose capacity region is well-known to be the set of all rate tuples (n, . . . , 7\l) satisfying: 



J2 r ? < logfl + SNR^IM 2 ) 



(126) 
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for all subsets S C {1,2,...,L} [58, Theorem 14.3.5]. We now show that compute-and-forward includes the 
multiple-access capacity region as a special case. First, we consider the corner point of the capacity region associated 
with decoding the messages in ascending order. From Example 2, this is possible if: 

r,<hjl + ' ft "'T V (127, 

Using Theorem 7, the relay removes xi from the channel observation to get J2e=2 hu~x-i + zi- It then repeats the 
above procedure for each message in ascending order to get: 

r( <log(l + ^5 ) (128) 

This is clearly a corner point of the multiple-access capacity region. By changing the decoding order, any corner 
point is achievable. Note that any point on the boundary of the capacity region is achievable by time-sharing 
between two corner points. 

VII. Superposition 

In the previous section, we considered the scenario where each relay decodes several equations, but the transmitters 
each use a single codebook (as in Theorem 1). However, when decoding multiple equations, it is sometimes useful 
to superimpose multiple codebooks. We investigate this possibility in this section. 

We will assume that there are two levels A and B and that each relay wants to a recover an equation from both 
levels. (If it is not interested in a level, it can just set its desired coefficients to zero.) 

Each encoder has two messages w<4 = (wf A , wf A ) and mv£b = ( w Ib> w 1b) with rates r iA and res respectively. 
Each relay wants to decode equations u^ A , and u^ B , vJ mB with coefficient vectors a m and b m respectively. In 
the theorem below, we give achievable rates for this scenario by combining superposition and successive cancellation. 
The basic idea is to superimpose two lattice codes at each receiver (in both the real and imaginary dimensions) 
scaled by 7^4 and 7^ to ensure that the power constraint is met. 

Theorem 8: Choose jeA,l£B suc h that \ jeA\ 2 + \iib\ 2 = 1- For channel vectors hi, . . . , h.M G C L , the relays 
can first decode any set of linear equations over wg A with coefficient vectors ai, . . . , &m G + j'Z} L and then 
any set of linear equations over ~W£b with coefficient vectors bi, . . . ,i>M € {Z + j'Z} L so long as the message 
rates are less than the computation rates: 

h m A = [jlAhml ■ ■ ■ r )LAh m L\ T 
h m B = YilBhml ■ ■ ■ jLBh m L] T 

r eA < min (i?i(h mj4 , h rnB , a m )) + 
i?i(h m A,h mB ,a m ) = log 



|a m | 2 (l + SNR||h mB || 2 ) + SNR\\a m h mA - a, " 2 



m I 



r (B < min (R 2 (h mA ,h m B,a m ,b m )y 

m:b me ^£0 



R2(h m A, h m B, a m , b m ) — 




m = Si for some i. 




SNR 



m | 2 (l + SNR hiAh me \ 2 ) + SNR||/3 m h mB - b m || 2 ^ 

SNR 



p m \ 2 + SNR||/3 m h mA - r m a m || 2 + SNR||/3 m h mB - b m || 2 7 

for some choice of a m , [3 m € C and r m eZ| jZ. 

Proof: Choose a set of nested lattices A, Ai^, . . . , Ala, Aib, • • • ,&lb with appropriate rates. Each encoder 
maps its messages onto lattices and dithers them with df A , d^, df B , d^ B generated independently and uniformly 
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from V: 



w 



I A 



% = [tf A - df A ] mod A + j[t^ - d£j mod A 
*ib = [t? B - df B ] mod A + j[t\ B - dj~ B ] mod A 
It then combines x^^ and x^b according to 7^4 and 7^3 which guarantees the power constraint is met: 

X £ = 7 M X M + -f£ B X£B 



n 



^[||x,|| 2 ] = (| 7 m| 2 + |7£b| 2 )SNR = SNR 



(129) 
(130) 
(131) 
(132) 

(133) 
(134) 



At each receiver, we can just treat the channel output as if it came from 2L transmitters labelled IA, . . . , LA, IB, . . . , LB. 
We can write the channel to receiver m and the desired coefficient vectors as: 



h m = 







b m = 







(135) 



We can now directly apply Theorem 7 with h m , a m , and b m to get the desired result. ■ 
Remark 17: In order to focus on concepts and keep notation manageable, we have chosen to present our result 
as above in Theorem 8. There are several immediate extensions, including: 

• More than two levels. 

• Allowing a different decoding order at each relay. 

• Equations spanning different levels. 

Example 7: There are L = 3 transmitters and M = 1 relay and the channel vector is hi = [1 1 V2] T . Levels A 
and B that use scaling coefficients 71,4 = 72A = 0, 71 # = 72B = 1, and 73^ = 73B = l/\/2. The relay wants to 
first decode the equation with coefficient vector ai = [0 1] T from level A and then the equation with coefficient 
vector bi = [1 1 1] T from level B. Using Theorem 8, this is possible if the message rates satisfy: 

SNR \ 
1 + 3SNR/ 



r 3A < log 1 + 



(136) 



r m < Aogf | + SNR 

Remark 18: It can be shown that nested lattice codes can approach the capacity region of the standard Gaussian 
broadcast problem. See [14] for more details. 

Remark 19: For an application of this superposition scheme to a backhauldimited cellular uplink network, see 
[41]. 



1 



«))" 



1,2,3. 



(137) 



VIII. Slow Fading and Distributed MIMO 

So far, we have considered fixed channel coefficients. Now, we demonstrate that our scheme can be applied to 
the slow fading scenario, even if the transmitters do not know the channel realization. Under a slow fading model, 
the channel matrix H is chosen according to some probability distribution and then remains fixed for all time. As a 
result, we must accept some probability that the rate used by the transmitters is above the maximum rate permitted 
for those channel coefficients. For an achievable strategy with rate -Rscheme(H) for fixed H, this outage probability 
is given by: 

Pout(^) = Ph («scheme(H) < R) (138) 

We can also characterize the performance of a given strategy by its outage rate: 

Rom(p) = sup{i? : pout(^) < p}- (139) 

We will now compare the outage performance of compute-and-forward to the performance of classical relaying 
strategies over a simple network. First, it is useful to note that usually, the exact choice of the coefficient vector 
at a relay is not important, so long as the resulting equations can be solved for the desired messages. Thus, to 
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maximize the performance of our overall scheme, we should have each relay decode the equation that is available 
at the highest rate. 

Consider the two user distributed MIMO network in Figure 7. There are two sources, two relays, and one 
destination. The relays see the transmitters through H whose entries are i.i.d. Rayleigh. Each relay is given a bit 
pipe with rate C bits per channel use to the destination. The destination would like to recover both message wi 
and W2 at highest possible symmetric outage rate. Recall that for a symmetric rate point to be achievable, both 
transmitters must be able to communicate their messages with at least that rate. 



Wi- 



w 2 - 



£1 








H 


£2 


x 2 









Z2 

u 



Y2 



K 2 - 



C 



C 



V — 



Wi 

w 2 



Fig. 7. Two transmitters communicate to a distributed MIMO receiver with two antennas. Each antenna has a rate C bit pipe to the receiver. 



The basic compute-and-forward strategy has each relay decode the equation with the highest rate and pass that 
to the destination. If the equations received by the destination are full rank, decoding is successful. Unfortunately, 
at low SNR, the probability that the equations are not full rank is quite high as shown in Figure 9. One simple 
solution is to force each relay to choose an equation with a mm / 0. This results in equations that are far more 
likely to be solvable at the expense of slightly lower computation rates. The achievable rates for these two strategies 
are given below and are plotted in Figure 8 for C = 2 and outage probability p = 1/4. 

SNR |h> m ' 2 



iW, m =ma X log l||a m || - 1 + SNR||h 




2 




SNR lr;„a m 
^ NZ , m =maxlog N||a m || - ^ - SNR||hJ|2 

u txi\ fmin m i?MAX,m if A is full rank, 
10 otherwise. 

^coMP, N z(H) = (^ i?NZ - ifAisMHank, 
10 otherwise. 

For decode-and-forward, we require that each relay is responsible for a single message. It attempts to recover 
this message either by treating the other message as noise or decoding both messages. The rate for this strategy is 
evaluated below and plotted in Figure 8. For more details on decode-and-forward for multiple relays (as well as 
compress-and-forward and cut-set upper bounds), see [3]. 
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SNRindB 

Fig. 8. Symmetric outage rates for the 2-user distributed MIMO multiple-access channel with i.i.d. Rayleigh fading only known at the 
receivers. Here, we set C = 2 and outage probability p = 1/4. 



/ |/in| 2 SNR \ 

^■ = M 1+ i + IM'snr ) (142) 

^ =^( 1+ iri&) (143) 

^decode,™ = min ^log (1 + |/i m i| 2 SNR) , 
log (1 + |/i„ l2 | 2 SNR) , 

ilog(l + ||h m || 2 SNR)) (144) 

Rii = min(i?ignore,l)-Rignore,2) (145) 

Rid = min(-Ri g nore,l)-Rdecode,2) (146) 

Rdi = mm (-Rdecode,l, -Rignore,2) (147) 

Rdd = min(i?decode,l, -Rdecode,2) (148) 

Rdf(H) = min(max(i? ii , R id , R di , R dd ), C) (149) 



For our upper bound, we use a cut-set bound that either groups the relays with the sources or with the destination. 
This yields the following bound on the symmetric rate: 

i?MiMo(H) = min ^log (l + (|/in| 2 + |/i 21 | 2 )SNR) , 
log(l + (|/i 12 | 2 + |/i 22 | 2 )SNR) , 

^logdet(I + HH*SNR)^ (150) 

iW R (H) = min(i?MiMo(H), C) (151) 

Finally, we consider the performance of compress-and-forward with i.i.d. Gaussian codebooks. The variance of 
the channel observation at relay m is 1 + ||h m || 2 SNR and we have to compress this using C bits. At the destination, 
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Fig. 9. Probability of rank failure for the 2-user distributed MIMO multiple-access channel by having each relay decode the best equation 
and the best non-zero equation. 



one can equivalently write this as a MIMO channel with channel matrix Hcf: 



_ SNR(2C-1) 

SNR CF , m - 2C + SN ^ (152) 



H 



CF 



VSNRcf.i _0 

V SNR CF,2 



H (153) 



Rcf(H) = min(i?MiMo(H C F), C). (154) 

From Figure 8, we can see that compute-and-forward (with the best equation) outperforms all other strategies 
starting at approximately 8dB. It also saturates the bit pipes to the destination at an SNR 5dB less than required for 
decode-and-forward. However, the gains are not as dramatic as observed in Example 4. For non-integer coefficients, 
we can only decode an integer combination and the remainder acts like additional noise. Despite this penalty, 
compute-and-forward is the best strategy in the moderate SNR regime. Compress-and-forward is a good strategy at 
low SNR but since it cannot saturate the bit pipe rate C since it introduces quantization noise. Decode-and-forward 
suffers at high SNR since it treats interference as noise. 

Note that the encoding strategy for compute-and-forward does not depend on the choice of equation coefficients 
at the relay. Therefore, one can obtain the maximum of the best equation rate and the best non-zero equation rate 
with the same strategy simply by disallowing certain coefficients at the relays past an appropriate SNR. 

Remark 20: Since the channel from the transmitters to the relays is essentially a 2-user interference channel, it 
may be useful to have each transmitter send out a public and a private message as in the Han-Kobayashi scheme 
[59]. Such a scheme might improve the performance of both the decode-and-forward strategy and the compute- 
and-forward strategy (by employing superposition as in Section VII). 



IX. Wireless Network Coding 

In our compute-and-forward strategy, messages and equations thereof take values on a finite field, making it 
ideally suited for linear network coding. Rather than choosing the network coding coefficients according to a 
uniform distribution [24], we choose the coefficients to closely match the channel realization. This allows the relay 
to operate at higher computation rates and, as a result, increases the end-to-end performance of the overall network 
code. 

In previous work, we have shown how to build network codes for finite field and Gaussian multiple-access 
networks [19], [27], [28]. Using the coding scheme developed in this paper, these results can be extended to 
include both broadcast constraints and fading. We demonstrate this through an example. 
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For our example, we will use a slight modification of the well-studied two-way relay network [25], [26], [29], 
[30], [60]. In the usual considerations, two users want to exchange messages through a relay that sees a noisy 
combination of their signals. The relay can communicate to the users through a broadcast channel and the goal is 
to maximize the symmetric rate. It is well-known that for the Gaussian two-way relay channel case without fading, 
nested lattice codes rapidly approach the upper bound as the SNR increases [29], [30]. Here, we consider a two-way 
relay channel with i.i.d. Rayleigh fading on all links (see Figure 10). We assume that the channel coefficients are 
only known at the receiving end and evaluate the outage performance as in Section VIII. The two users have power 
constraint SNR and the relay has power constraint SNR B c- 




Fig. 10. Two-way relay channel with fading. Two users exchange messages with the help of an intermediate relay. 

The multicast rate from the relay to two users for a given channel realization is just the minimum of the two 
point-to-point rates: 

R BC = log (l + (min(|/i 3 | 2 , ^ 4 | 2 ))SNR B c) . (155) 

For compute-and-forward, we can guarantee that each user can solve for its desired message by decoding a message 
with non-zero coefficients at the relay. We also let the relay perform successive cancellation to recover the messages 
individually. This allows it to access the corner points of the multiple-access region. For each channel realization, 
the relay selects the strategy that results in the highest symmetric rate. 

p . ((.. ||2 SNRIh^l 2 

-R NZ = max log ||ai|| - 



an,a 12 ^0 ^ l + SNR||hi|| 2 

/ / |/in| 2 SNR \ 
R ^ = mm V + l + |/*i 2 | 2 SNR J ' 

log(l + |/ii 2 | 2 SNR)^ 

iW, 2 = mm (bg ^1 + 1 + M2SNR ) > 
log(l + |/in| 2 SNR)^ 

i?COMp(H) = min ^R BC , max (^R NZ , -Rcorner.l, ^corner, 2^ 

The relay is required to recover both message for decode-and-forward and the rate is simply the maximum 
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symmetric rate in the multiple-access capacity region. 

-^decode = min ^log (l + |/in| 2 SNR) , 
log(l + |/i 12 | 2 SNR), 

^ log (1 + ||hi|| 2 SNR)^ (156) 

i? DF (H) = min ^R B c, ^decode ^ (157) 
The amplify-and-forward scheme has the relay scale its observation to meet the power constraint and retransmit. 

o m , ■ (, A , |^i2^ 3 | 2 SNR SNR B c 
-Raf(H) = mm log 1 



1 + ||hi|| 2 SNR+ |/i 3 | 2 SNR B c/ ' 
|/in/i 4 | 2 SNR SNRbc 



log ^1 + i + || hi ||2 SNR + l/^psNRac. 

Finally, since we are only interested in the symmetric rate, the upper bound is just the minimum over all channel 
strengths. 

«upper(H) = min (# BC ,log (l + |/in| 2 SNR) , 

log(l + |/i 12 | 2 SNR)^ (158) 




Fig. 11. Outage rate per user for the two way relay channel with i.i.d. Rayleigh fading only known at the receivers. Here, we set 
SNRbc = 2SNR and outage probability p = 1/3. 

In Figure 11, we plot the performance of these schemes for SNR B c = 2SNR and outage probability p = 1/3. 
Note that here there is no possibility of rank failure for compute-and-forward and it is always at least as good as 
decode-and-forward. However, unlike the no fading case, compute-and-forward does not approach the upper bound 
as the SNR increases. In fact, past a certain SNR (in this case 25dB), compress-and-forward is the best strategy. This 
is primarily due to the noise penalty for decoding an integer combination from non-integer channel coefficients. 
For the no fading case, the channels coefficients are assumed to be unity and this penalty does not appear. 

The basic insight is that, for certain network topologies, the non-integer penalty can become significant. This 
penalty can be managed in several ways, including: 

• Adjusting the channel at the transmitters if channel state information is available. 
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• Using multiple antennas at the relay to steer the coefficients towards integers [61]. 

• Selecting transmitters opportunistically based on how close their coefficients are to integers. 

However, despite the non-integer penalty, compute-and-forward offers a strict improvement over classical strate- 
gies for reliably recovering equations over a finite field. This, combined with the fact that amplify-and-forward 
requires fairly high values of SNR to overcome noise build-up, makes compute-and-forward a promising strategy 
for physical-layer network coding for multi-hop networks. 

X. Upper Bound 

In this section, we give a simple upper bound on the computation rate through a genie-aided argument. This 
bound does not match our achievable strategy in general and it may be possible to construct tighter outer bounds by 
taking into account the mismatch between the desired function and the function naturally provided by the channel. 

Theorem 9: Assume the channel between the transmitters and the relays is p(yi, . . . , Vm\xi, ■ ■ ■ , If the 
relays, want equations with coefficient vectors ai, . . . , &m € + j^} L , the message rates are upper bounded as 
follows: 

r t < min I(Xf,Y m \X 1 ,...,X e _ 1 ,X e+1 ,...,X L ) 

m:a m i^0 

For the Gaussian channel model considered in this paper, with channel vectors hi, . . . h.M € C L , this specializes 
to: 

r e < min log (l + |/i m£ | 2 SNR) (159) 

m:a m {^0 

Proof: To each relay m for which a m i / 0, we provide all messages except that from encoder t as genie-aided 
side-information. Now, we are left with a multicasting problem from encoder I to all relays with a m i / 0. Clearly, 
the multicast rate is upper bounded by the lowest rate link. For the Gaussian case, it is easy to show that the mutual 
information expressions are maximized by the Gaussian distribution. ■ 

XI. Conclusions 

In this paper, we have developed a new coding scheme that enables intermediate relays to reliably recover 
equations of the original messages by exploiting the interference structure of the wireless channel. As we have 
seen, this framework can achieve end-to-end rates across an AWGN network that are not accessible with classical 
relaying strategies. More generally, the techniques in this paper can be used as building blocks for developing new 
cooperative communication schemes that exploit both the algebraic and statistical properties of wireless networks. 
Here, we discussed applications to distributed MIMO and wireless network coding and we believe there are many 
more scenarios where it will be useful. For instance, in [62], we explored an application to physical-layer gossiping 
in a sensor network. 

Compute-and-forward also adds to the growing pile of evidence that structured codes are a powerful tool for 
tackling problems in multi-user information theory. Recently, many new inner bounds have emerged that take 
advantage of the algebraic structure of multi-user problems. The behavior observed in these strategies is not well- 
captured by the usual cut-set outer bounds. Therefore, new outer bounds that account for algebraic as well as 
statistical structure will be needed to better characterize the capacity regions of multi-user networks [63]. 

Appendix I 
Upper Bound on Noise Densities 

In this appendix, we demonstrate that the densities of the noise terms in Theorem 3 are upper bounded by the 
density of an i.i.d. Gaussian vector. The proof follows that of Lemmas 6 and 11 from [13]. 

Lemma 8: Let z ~ A/"(0, |l nxri ) and let be independently generated according to a uniform distribution over 
V. Also, let cr| denote the second moment of an n-dimensional ball whose radius is equal to the covering radius 
r cov of A and let z^ be independently generated according to jV(0, a B I nxn ). Now, let 

L 

z eq = az + J2°e d e ( 16 °) 
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where a, 0£ £ R. There exists an i.i.d. Gaussian vector 

L 

z* = az + Y J d e**e (161) 

e=i 

with variance a 2 satisfying 

Z V ' EFFEC / Z i=1 

such that the density of z eq is upper bounded as follows: 

f z Jz)<e Lc ^ n U(z) (163) 

c(n) = In f-^-) + I In 2716^ + - (164) 

V r EFFEC J ^ n 

where G B is the normalized second moment of an n-dimensional ball and r EFFEC is the effective radius of A. 

Proof: First, we will show that the density of z eq is upper bounded as desired. From Lemma 11 in [13], we 
have that: 

M t (z)<e c(n)n f z ;(z) (165) 

Since z, di, . . . , dt, are independent, we can write the density of z eq as an n-dimensional convolution of the densities 
of its components: 

/z e ,(z) = /az(z) * /0!di(z) * • • • * fo L d L (z) (166) 

Similarly, we can write the density of z* as: 

/»• (z) = * (z) * • • • * /„ lB . (z) (167) 

Since probability densities are non-negative, we can use the upper bound in (165) to get: 

/az(z) * Wz) < f az (z) * e c ^ n f 9tzl (z) (168) 
Applying this idea L times to f Zcq (z) yields: 

f z Jz)<e Lc ^ n f z *(z) (169) 

We must now upper bound the variance of z*. By Definition 20, Vol(,6(r EFFEC )) = Vol(V). Recall that a ball has 
the smallest second moment for a given volume. Let b be generated according to the uniform distribution over 
£>(r cov ). It follows that 

S -f= 1 -E [||d,|| 2 ] (170) 



>l E 

n 



' EFFEC 



2 



4 (171) 



Finally, we get: 



a 2 = ^[||H| 2 ]+^E^[ll^|| 2 ] (172) 



l 

Z V ' EFFEC / Z 

■ 

Since and -► 1 and — > ^ as n — > oo, c(n) — > as n — > oo. AWGN good lattices have a positive 

error exponentTor i.i.d. Gaussian noise with variance smaller than the second moment of the lattice so this means 
that the probability of error can be driven to zero as the blocklength increases. 
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Appendix II 
Fine Lattices are AWGN Good 

We now show that the fine lattices can recover from i.i.d. Gaussian noise. 

Lemma 9: Ai, A2, . . . , Ax, are AWGN good with probability that goes to 1 as n — ► 00 so long as ^ — ► 0. 
Proof: Let zi denote an i.i.d. Gaussian vector with zero-mean and any variance a 2 such that the volume-to- 
noise ratio for Ai is greater than 27re. Consider the following channel from X£ G V to X£ G V: 

x« = [Q Af (x£ + z*)] mod A (175) 

and let P e g = Pr(x^ / x^). In Appendix B of [13], it is shown that the random coding error exponent for 
this channel (with x^ generated uniformly over V) is equal to the Poltyrev exponent. This means that P e ^ 
decreases exponentially with n for volume-to-noise ratio greater than 27re. Appendix C of [13] shows that the 
same performance is possible if x^ is drawn according to a uniform distribution over {p -1 A} n V and ^ — > 0. 

From Lemma 3, we know that the marginal distribution of each element of Ai n V is uniform over {p _1 A}nv. 
Furthermore, all points in the set A/H V are pairwise independent. This is all that is required to apply the union 
bound and obtain the same performance as i.i.d. inputs over {p~ 1 A} n V in terms of the error exponent. 

Thus, the probability that Ai is AWGN good (with the Poltyrev error exponent) goes to 1 as n — ► 00. It follows 
from a simple union bound that Ai, . . . , Ax are simultaneously AWGN good with high probability as n — ► 00. ■ 



Appendix III 
Fixed Dithers 

We now show that there exist fixed dithers that are appropriate for our coding scheme. We begin by showing 
that with high probability over the dither vectors, the power constraint can be met (since in the proof of Theorem 
3 this is only done in expectation). 

Choose 5 > 0. Scale the second moment of the coarse lattice to be a\ = — 5 ( Teffec ) and consider the 

V r cov / 

real part of the channel input from encoder £, [tf — df] mod A. By Lemma 7, this is uniformly distributed over 
V. Let d be a vector drawr^from the uniform distribution over V and z* be an i.i.d. Gaussian vector with mean 
and variance a 2 = ( ^ COY ) u\. Since z* is i.i.d. Gaussian, by the weak law of large numbers 

V EFFEC / 

2a 4 



a 



n 



> 6 < 



8n 



(176) 



From Lemma 11 in [13], we have that: 



with c(n) as in (164). Note that a 2 
be upper bounded in terms of a 2 : 



/ d (z) < e c ^ n 
2 



SNR 

2 



/z«(z). (177) 
— 5 so the probability that d violates the power constraint can 



> 



SNR\ 



n 





11 j 11 2 2 

— d — a 




n 



> 5 



< e 



c{n)n 



^1 

5n 



(178) 
(179) 



This goes to as n goes to infinity so, by the union bound, all channel inputs obey the power constraint with high 
probability. Since A is covering good and the rates are continuous in the second moment of the coarse lattice, we 
can choose a decreasing sequence of 5 that approaches the rates achieved when a\ = 

We note that the probability of decoding error in Theorem 3 goes to zero as n increases when using random 
dithers. Taking a union bound over this error event and the event that any dither exceeds a power constraint, we 
get that for n large enough there is (at least) one good fixed set of dither vectors. 
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